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Epidemic processes are common out-of-equilibrium phenomena of broad interdisciplinary interest. 
Recently, dynamic message-passing (DMP) has been proposed as an efficient algorithm for simulat¬ 
ing epidemic models on networks im and in particular for estimating the probability that a given 
node will become infectious at a particular time. To date, DMP has been applied exclusively to mod¬ 
els with one-way state changes, as opposed to models like SIS (susceptible-infectious-susceptible) 
and SIRS (susceptible-infectious-recovered-susceptible) where nodes can return to previously inhab¬ 
ited states. Because many real-world epidemics can exhibit such recurrent dynamics, we propose 
a DMP algorithm for complex, recurrent epidemic models on networks. Our approach takes corre¬ 
lations between neighboring nodes into account while preventing causal signals from backtracking 
to their immediate source, and thus avoids “echo chamber effects” where a pair of adjacent nodes 
each amplify the probability that the other is infectious. We demonstrate that this approach well 
approximates results obtained from Monte Carlo simulation and that its accuracy is often superior 
to the pair approximation (which also takes second-order correlations into account). Moreover, our 
approach is more computationally efficient than the pair approximation, especially for complex epi¬ 
demic models: the number of variables in our DMP approach grows as 2mk where m is the number 
of edges and k is the number of states, as opposed to mk 2 for the pair approximation. We suspect 
that the resulting reduction in computational effort, as well as the conceptual simplicity of DMP, 
will make it a useful tool in epidemic modeling, especially for inference tasks where there is a large 
parameter space to explore. 


I. INTRODUCTION 

Mathematical models of epidemic processes are intrinsically non-linear and multiplicative. These models 
include the spread of disease mm, transmission of social behaviors EHU, cascades of banking failures m 
EH, forest fires mm, the propagation of marginal probabilities in constraint satisfaction problems QH CEH] 
and the dynamics of magnetic and glassy systems m- 

The classical approach to modeling epidemics, such as the SIR model where each node is Susceptible, 
Infectious, or Recovered, assumes that at any given time each individual exists in a single state or “com¬ 
partment” Gad- To make these models analytically tractable, it is often assumed that the population is 
well mixed, so that interaction between any two individuals is equally likely; in physical terms, we assume 
the model is mean-field (also known as mass-action mixing in the epidemiology literature). Despite this 
unrealistic assumption, mean-field models capture some essential features of epidemics, such as a threshold 
above which we have an endemic phase with a non-zero fraction of infected individuals, and below which we 
have outbreaks of size o(n) so that the equilibrium fraction of infected individuals is zero. 

In reality, contacts between individuals in the population are often highly structured, with some pairs of 
individuals much more likely to interact than others due to location or demographics mm- To relax the 
mean-held assumption, while retaining some measure of tractability, we can assume that individuals interact 
on a network, whose structure captures the heterogeneity in the population [2T1I22] . However, replacing the 
mean-held approximation with a contact network substantially increases a model’s complexity. 

One reasonable goal is to compute the one-point marginals, e.g., for each node i the probability Ii(t) 
that i is infectious at time t. In addition to being of direct interest, these marginals help us perform tasks 
such as inferring the originator of an epidemic, determining an optimal set of nodes to immunize in order 
to minimize the hnal size of an outbreak, or calculating the probability that an entire group of nodes will 
remain uninfected after a hxed time (251127) . 

We can always compute these marginals by performing Monte Carlo experiments. However, since we need 


to perform many independent trials in order to collect good statistics, this is computationally expensive on 
large networks. This problem is compounded if we need to scan through parameter space, or if we want 
to explore many different initial conditions, vaccination strategies, etc. Therefore, it would be desirable to 
compute these marginals using, say, a system of differential equations, with variables that directly model the 
probabilities of various events. 

The most naive way to do this, as we review below, uses the one-point marginals themselves as variables. 
However, this approach completely ignores correlations between nodes. At the other extreme, to model the 
system exactly, we would need to keep track of the entire joint distribution: but if there are u individuals, 
each of which can be in one of k states, this results in a coupled system with k n variables. This exponential 
scaling quickly renders most models computationally intractable, even on moderately sized networks. 

In between these two extremes, we can approximate the joint distribution by “moment closure,” assuming 
that higher-order marginals can be written in terms of lower-order ones. This gives a hierarchy of increasingly 
accurate (and computationally expensive) approximations, familiar in physics as cluster expansions. At the 
first level of this hierarchy we assume that the nodes are uncorrelated, and approximate two-point marginals 
such as [it (t) Alj(t)] (the probability that i and j are both infectious at time t) as Ij(t)Ij(t). At the second 
level, commonly referred to in the epidemiology literature as the pair approximation, we close the hierarchy 
at the level of pairs [It(t) Alj(t)] by assuming that three-point correlations can be factored in terms of 
two-point correlations. For a comprehensive review of these methods, see [221 [25]. 

In this paper, we study an alternative method, namely Dynamic Message-Passing (DMP). As in belief 
propagation [32, 33], here variables or “messages” are defined on a network’s directed edges: for instance, 
Ij_n denotes the probability that j was infected by one of its neighbors other than i, so that the epidemic 
might spread from j to i. However, unlike belief propagation, where the posterior distributions are updated 
according to Bayes’ rule, here we write differential equations for the messages over time. 

For many epidemic models, such as SI (susceptible-infectious), SIR (susceptible-infectious-recovered) and 
SEIR (susceptible-exposed-infectious-recovered), only one-way state changes can occur. For example, in the 
SIR model, once an individual has left the Susceptible class and become Infectious, they cannot return to 
being Susceptible; once they become Recovered, they are immune to future infections, and might as well be 
Removed. For these non-recurrent models, DMP is known to be be an efficient algorithm to estimate Ii(t), 
and it is exact on trees [Ij; it can also be applied to threshold models [3H5] and used for inference [23j . 

However, for many real-world diseases individuals can return to previously inhabited states. In these 
recurrent models, such as SIS (susceptible-infectious-susceptible), SIRS (susceptible-infectious-recovered- 
susceptible), and SEIS (susceptible-exposed-infectious-susceptible), individuals can cycle through the states 
multiple times, giving multiple waves of infection traveling through the population. The most obvious 
examples of recurrent models are seasonal influenza, where due to the evolution of the virus individuals are 
repeatedly infected during their lifetime Bn. vaccination where protective immunity wanes over time [421 , and 
diseases curable by treatment which does not result in antibody-mediated immunity, such as gonorrhea 021 - 
In all three cases, individuals leave the Susceptible class, only to return at some point in the future (although 
for influenza, it is worth mentioning that if the evolutionary rate of the virus is functionally related to the 
number of susceptible individuals, then the recovery rate may not be independent from the state of one’s 
neighbors.) Unfortunately, the DMP approach of [I] cannot be directly extended to recurrent models, since 
their equations for messages only track the first time an individual makes the transition to a given state. 

The purpose of this paper is to develop a novel DMP algorithm for recurrent models of epidemics on 
networks, which we call rDMP. We will show that rDMP gives very good approximations for marginal 
probabilities on networks, and is often more accurate than the pair approximation. Moreover, whereas the 
pair approximation requires keeping track of mk 2 variables, if there are m edges and k states per node, 
rDMP requires just 2mk variables. For complex models where k is large—for instance, for diseases with 
multiple stages of infection or immunity, or multiple-disease epidemics where one disease makes individuals 
more susceptible to another one—this gives a substantial reduction in the computational effort required. 
Finally, the rDMP approach is conceptually simple, making it easy to write down the system of differential 
equations for a wide variety of epidemic models. 
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FIG. 1. We define messages on the directed edges of a network to carry causal information of the flow of contagion, 
e.g. Ij_> t is the probability that j is Infectious because it received the infection from a neighbor k other than i. This 
prevents effects from immediately backtracking to the node they came from, and avoids “echo chamber” infections. 
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FIG. 2. Two simple, yet illustrative, cases of networks, where the darker node is initially Infectious. As we discuss, 
in these simple cases one can see the motivation for our approach to prevent infection signals from backtracking to 
the node it immediately came from. 


II. MESSAGE-PASSING AND PREVENTING THE ECHO CHAMBER EFFECT 

As shown in Fig. [lj the variables of rDMP are messages along directed edges of the network (in addition 
to one-point marginals). For instance, Ij_>.i is the probability that j is Infectious because it was infected by 
one of its other neighbors k. The intuition behind this is the following, where we take the SIS model as an 
example. If i is Susceptible, the rate at which j will infect i is proportional to the probability Ij that j is 
infected. But when computing this rate, we only include the contribution to Ij that comes from neighbors 
other than 1. In other words, we deliberately neglect the event that j receives the infection from i, and 
immediately passes it back to i, even if i has become Susceptible in the intervening time. 

This choice avoids a kind of “echo chamber” effect, where neighboring nodes artificially amplify each 
others’ probability of being Infectious. For instance, consider a simple but pathological case of the SI model 
where there are only two nodes in the graph, i and j, with an edge between them as shown in Fig. [2] If 
the transmission rate is A, and if we assume the nodes are independent (i.e., if we use first-order moment 
closure) we obtain the following differential equations, 


£=“*'« 
fH 3 *- 


(i) 


where St(t) = 1 — b(t) and similarly for j. 

Now suppose that j is initially Infectious with probability 6, and that i is initially Susceptible, i.e., Ij (0) = 6 
and li(0) = 0. Since in the SI model nodes never recover, the infection will eventually spread from j to i, 
but only if i was Infectious in the first place. Thus the marginals Ij.(t) and Ij (t) should tend to 5 as t —> oo. 

However, integrating Eq. ([!]) gives a different result. Once U becomes positive, dlj/dt becomes positive 
as well, allowing i to infect j with the infection that it received from j in the first place. As a result, Ij(t) 
approaches 1 as t —>■ oo. Thus the “echo chamber” between i and j leads to the absurd result that j eventually 
becomes Infectious, even though with probability 1 — 6 there was no initial infection in the system. 

In the rDMP approach, we fix this problem by replacing p and Ij with the messages they send each other, 


dlj 

dt 


ASiIj_ 




pi-q > 
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so that i can only infect ) if i received the infection from some node other than j. (Below we give the 
equations on a general network, including the time derivatives of the messages.) In this example, there are 
no other nodes, so if lj_^^(0) = 6 and Ii_q (0) = 0, then Ij (t) = 6 for all t as it should be. 

Note that we do not claim that rDMP is exact in this case. In particular, as in 0. Ii(t) tends to 1 as 
t — > oo. This is because, unlike the system of [Tj, rDMP assumes that the events that j infects I at different 
times are independent. 

In this two-node example, of course, the pair approximation is exact, since it maintains separate variables 
such as [Sj A I k ] for each of the joint states of the two nodes. However, the pair approximation is subject to 
other forms of the echo chamber effect. Consider a network with three nodes, as in Fig. [2] (right), where j is a 
common neighbor of i and k. The pair approximation assumes that, conditioned on the state of j, the states 
of i and k are independent; however, in a recurrent epidemic model, i and k could be correlated, for instance 
if j infected them both and then returned to the Susceptible state. As a result, the pair approximation is 
vulnerable to a distance-two echo chamber, where i and k infect each other through j. As in the two-node 
case, rDMP prevents this. 

Preventing backtracking completely may seem like a strong assumption, and in recurrent models it is 
a priori possible, for instance, for a node to re-infect the neighbor it was infected by. Despite the well- 
documented importance of recurrent infections for diseases including (but certainly not limited to) seasonal 
influenza |41j , Plasmodium malaria |49| , and urinary tract infections |48| , little is known about the source of 
recurrent infections. For certain sexually transmitted diseases such as gonorrhea |47| and repeated ringworm 
infections [SOi . there is evidence that backtracking plays a significant role; on the other hand, it may be that 
recurrent infections are caused by different strains, each of which is acting essentially without backtracking. 
Thus while our non-backtracking assumption is clearly invalid in some cases, we believe it is a reasonable 
approach for most recurrent state infections. 


III. THE rDMP EQUATIONS FOR THE SIS, SIRS, AND SEIS MODELS 


In this section, we illustrate the rDMP approach for several recurrent epidemic models. We start with the 
simplest one: in the SIS model, each node is either Infectious (I) or Susceptible (S). Infectious nodes infect 
their Susceptible neighbors at rate A, and their infections wane back into the Susceptible state at rate p. We 
denote the probability that that node I is Infectious or Susceptible by It and Si respectively. The objective 
then is to efficiently and accurately compute these probabilities as a function of time t. 

We define variables or “messages” that live on the directed edges (i, j) of the network. The directed nature 
of these messages prevent infection from backtracking from an Infectious node back to its infection source, 
e.g., if node I infects node j, then we prevent j from re-infecting i. In addition to tracking the one-point 
marginal Ij, we define a message Ij_>i from j to i as the probability that j is in the Infectious state as a result 
of being infected from one of its neighbors other than i. Given these incoming messages, the rate at which 
Ii evolves in time is given by 


dh 

dt 


-pit + ASi Y_ Ij-R. 

jG3i 


( 2 ) 


where 3i denotes the neighbors of i. Similarly, the rate at which Ij_^ evolves in time is given by 


din 

dt 


pi j —n T ASj y Ik—>j) 

ke9j\i 


(3) 


where k£ 0j \ i denotes the neighbors of j excluding i. 

For the SIRS model, we let p and y denote the transition rates from Infectious to Recovered and from 
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Recovered to Susceptible respectively. Then the rDMP system for the SIRS model is given by 


-4±i=-pI,_n+AS, X 

ke3j\i 


(4) 


which is coupled with the one-point marginals through 

dSt v— 

_ =yR ._ ASl ^ I ^ l 

jG3i 

dli V 

= -pli + ASi 2_ Ij-»i 

jG3t 

dRi 

dt = pli-YRi- (5) 

In the SEIS model, upon becoming exposed to an infected neighbor, Susceptible nodes first go through a 
latent period called the Exposed state. In this state, individuals are infected but not yet Infectious. Exposed 
nodes become Infectious at the rate e, and Infectious nodes again wane back to Susceptible at rate p. The 
rDMP system for the SEIS model is 


£Ej_>t + ASj y Ik—q j 

ke3j\i 

plj—>i T , (6) 

which is coupled with the one-point marginals as 

pit - ASi Y T J-»t 

)S3i 

-pit + eE t 

-elt + ASi Y W ' ( 7 ) 

jS3i 

Note that here we track messages for the Exposed state, in addition to one-point marginals, since they act 
as precursors for the Infectious messages. There is no need to track messages for the Susceptible state, since 
it does not cause state changes in its neighbors. 

Generalizing these equations to more complex epidemic models with k different states, as opposed to three 
or four, is straightforward. Even in a model where every state can cause state changes in its neighbors—for 
instance, where having Susceptible neighbors speeds up the rate of recovery, or where Exposed nodes can 
also infect their neighbors at a lower rate—the total number of variables we need to track in a network 
with n nodes and m edges is at most 2mk in addition to the nk one-point marginals. In contrast, the pair 
approximation requires mk 2 states to keep track of the joint distribution of every neighboring pair. 


dS t 

dt 

dlt 

dt 

dE t 

dt 


dEj_>.i 

dt 

dlj_»j 

dt 


IV. EXPERIMENTS IN REAL AND SYNTHETIC NETWORKS 


In this section we report on numerical experiments for rDMP for the SIS and SIRS models on real and 
synthetic networks. As a performance metric, we use the average Li error per node between the marginals 
computed from rDMP and the true probabilities computed (up to sampling error) using continuous-time 
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FIG. 3. Results on the SIS model. On the left, the marginal probability that node 29 in Zachary’s Karate club (see 
inset on right) is Infectious as a function of time. We compare the true marginal derived by 10 5 independent Monte 
Carlo simulations with that estimated by rDMP, the independent node approximation, and the pair approximation. 
On the right is the Lx error, averaged over all nodes; we see that rDMP is the most accurate of the three methods. 
Here the transmission rate is A = 0.1, the waning rate is p = 0.05, and vertex 0 (colored red) was initially infected. 



FIG. 4. A scatterplot of the steady-state marginals R for the n = 33 nodes in Zachary’s Karate Club, with the 
same parameters as in Fig. [3] The vertical axis is the true marginal computed by Monte Carlo simulations; the 
horizontal axis is the estimated marginals from rDMP (black *) and the pair approximation (blue x). Both methods 
overestimate the marginal, but rDMP is closer to the true value (the line y = x) for every node. 


Monte Carlo simulations. That is, 


Li DMP (t) 


1 

n 


X |lf C (t) 


-I r t DMP (t)| , 


( 8 ) 


We use this metric to compare the performance of rDMP with the independent-node approximation and 
the pair approximation, or equivalently first- and second-order moment closure (221 [35]. As we will see, 
for a wide range of parameters, rDMP is more accurate than either of these approaches, even though it is 
computationally easier than the pair approximation. 


6 



































Time t 


l 0.016 
- 0.008 
- 0.000 
- - 0.008 

U - 0.016 
- 0.024 
- 0.032 
- 0.040 


FIG. 5. The difference between Lj DMP and L palr on Zachary’s Karate Club for various values of the ratio p/A. We 
rescale time so that A = 0.1 as before. In the blue region, [_j DMP < L palr and rDMP is more accurate; in the red 
region, Lj DMP > L palr . We see that rDMP is more accurate except at early times or when p/A is small. 


In Fig. [3j we show results for the SIS model on Zachary’s Karate Club |34| . On the left, we show the 
marginal probability that a particular node is Infectious as a function of time, estimated by rDMP and 
by first- and second-order moment closure, and compared with the true marginals given by Monte Carlo 
simulation. On the right, we show the average Li error for the three methods. Here A = 0.1, p = 0.05, and 
the initial condition consists of a single infected node (shown in red in the inset). The Monte Carlo results 
were averaged over 10 5 runs. We see that rDMP is significantly more accurate than the other two, except 
at some early times when the pair approximation marginally outperforms rDMP. 

As a further illustration, in Fig. [4] we show the steady-state marginal R for each node I (measured by 
running the system until t = 50, at which point Ii(t) is nearly constant), with the same parameters and 
initial condition as in Fig. [3] We show the true marginal of each node on the y-axis, and the marginals 
estimated by rDMP and the pair approximation on the x-axis. If the estimated marginals were perfectly 
accurate, the points would fall on the line y = x. Both methods overestimate the marginals to some extent, 
but rDMP is more accurate than the pair approximation on every node. Thus rDMP makes accurate 
estimates of the marginals on individual nodes, as opposed to just the average across the population. 

To investigate how rDMP compares with the pair approximation across a broader range of parameters, 
in Fig. [5] we vary the ratio between waning rate p and the transmission rate A. Since we can always rescale 
time by multiplying A and p by the same constant, we do this by holding A = 0.1 as before, and varying p. 
We then measure the difference in the Li error of the two methods, Lj DMP — L pair . 

In the blue region, rDMP is more accurate than the pair approximation; in the red region, it is less so. 
We see that rDMP is more accurate except at early times (as in Fig. [ 3 ]) or when p is small compared to A, 
i.e., if the model is close to the SI model where Infectious nodes rarely become Susceptible again. 

In Fig. [Gj we simulate the SIS model on an Erdos-Renyi graph with n = 100 and average degree 3, with 
A = 0.4, p = 0.1, and a single initially Infectious node. As with the Karate Club, rDMP does a better job of 
tracking the true fraction of Infectious nodes, except at early times when the pair approximation is superior; 
in particular, it does a better job of computing the steady-state size of the epidemic. 

In Fig. [7] we show results for the SIRS model on Zachary’s Karate Club. As in Fig. [3} on the left we show 
the marginal probability I 2 g that node 29 is Infectious; on the right, we show the Li error for R averaged 
over the network. In the insets, we show the marginal probability R 2 g for the Recovered state and the 
corresponding average Li error. Here the transmission rate is A = 0.1, the waning rate from Infectious to 
Recovered is p = 0.05, and the rate from Recovered to Susceptible is y = 0.2. The initial condition consisted 
of a single infected node, and Monte Carlo results were averaged over 10 5 runs. As for the SIS model, rDMP 
is significantly more accurate than the independent node approximation, and is more accurate than the pair 
approximation except at early times. 

We found similar results on many other families of networks, including random regular graphs, random 
geometric graphs, scale-free networks, Newman-Watts-Strogatz small world networks, and a social network 
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FIG. 6. The fraction f of Infectious nodes as a function of time in the SIS model on an Erdos-Renyi graph (inset) 
with n = 100 and average degree 3. Here A = 0.4, p = 0.1, and the initial condition consists of a single Infectious 
node (colored red). Monte Carlo results were averaged over 10 3 independent runs. Except at early times, rDMP 
tracks the true trajectory more closely. 




FIG. 7. The SIRS model on the Karate Club. On the left, we show the true and estimated marginal probability that 
a node 29 is Infectious (main figure) or Recovered (inset) as a function of time. On the right is the average I_! error 
for the Infectious and Marginal states. The transmission rate is A = 0.1, and the transition rates from Infectious 
to Recovered and from Recovered to Susceptible are p = 0.05 and y = 0.2 respectively. Node 0 (colored red) was 
initially infected. Monte Carlo results were averaged over 10 5 runs. As for the SIS model, rDMP is significantly more 
accurate than the first-order model where nodes are independent, and is more accurate than the pair approximation 
except at early times. 


of dolphins [291 . Namely, rDMP outperforms the first-order approximation where nodes are independent, 
and outperforms the pair approximation across a wide range of parameters and times. 




































V. LINEAR STABILITY, EPIDEMIC THRESHOLDS, AND RELATED WORK 


Systems of differential equations for rDMP, such as ([3]), do not appear to have a closed analytic form 
due to their nonlinearities. On the other hand, we can compute quantities such as epidemic thresholds 
by linearizing around a stationary point, such as {I?^ = 0} where the initial outbreak is small. Given a 

perturbation e^t = Ij_>.|_ — I?_, the linear stability of the system, i.e., whether or not £j_>i diverges in time, 

is governed by the eigenvalues of the Jacobian matrix J of the right hand side of ([3]) at the stationary point 
It. The Jacobian for © at {I-l^} is 

J(j— = — Skj^ij'P + A(1 — Ij . (9) 


where 


®(j—>j'} —6ik) ■ 


( 10 ) 


This definition of B is another way of saying that the edge k -A j influences edges j —> i for i ^ k, but does 
not backtrack to k. This corresponds to our assumption that infections, for instance, do not bounce from k 
to j and back again and create an echo chamber effect. For this reason, B is also known in the literature as 
the non-backtracking matrix [551 or the Hashimoto matrix El- 

Now, for a small perturbation e away from a stationary point {It i ), the linearized system of © becomes 


de 

dt 


= Je, 


( 11 ) 


If J has any eigenvalues with positive real part, then ||e(t]|| grows exponentially in time. So, the fixed point 
{Ij—s-i} is stable as long as the leading eigenvalue Ji of J has negative real part. 

One trivial, but important, stationary point to test is It_ >i = 0 for all edges. A small perturbation around 
0 corresponds to a small initial probability that each node is infected. From ([9]), J becomes 

J = a(b-£i), (12 ) 


where 1 is the 2m x 2m identity matrix. So, the leading eigenvalue of J becomes positive when the largest 
eigenvalue Bi of B is greater than p/A. In other words, if 


R 0 — -Bi ^ 1. 
P 


(13) 


where Rq is the reproductive number, even a small initial probability of infection will lead to a widespread 
endemic state, where the infection becomes extensive. If (131 does not hold, a small initial probability of 
infection will instead decay back to an infection-less state. 

Since B is not symmetric, not all its eigenvalues are real. However, by the Perron-Frobenius theorem, 
it’s leading eigenvalue is real; moreover, it is upper bounded by Ai, the leading eigenvalue of the adjacency 
matrix A. Interestingly, if we examine the linear stability of the first-order approximation where nodes are 
independent, [j52], the epidemic threshold for the SIS model is given by 


A 


-A, > 1. 


(14) 


Since Bi ^ Ai, the threshold (13) gives a better upper bound for the true epidemic threshold than we 


would get from the first-order approximation. A similar threshold for the SIR model in sparse networks, or 
equivalently for percolation, using Bi was recently demonstrated in [37). (We note that when backtracking 
is allowed, it has important consequences for epidemic thresholds on power-law networks I38|.l 

Whereas the leading eigenvector of B governs the epidemic threshold, the spectral gap between B’s top 
two eigenvectors governs how quickly the epidemic converges to the leading behavior (at least until we leave 
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FIG. 8. Same as in Fig. [3] but with transmission rate A = 0.1 and waning rate p = 0.54. A well known upper bound 
on the epidemic threshold of the SIS model can be computed from the leading eigenvalue Aj of the adjacency matrix 
(the Jacobian matrix of first-moment-closure approach) of a network. In other words, if ^ < Ai, it is known from the 
first-moment-method that an infection-free state becomes unstable and epidemics become widespread and endemic. 
Here we show the results from SIS model in Zachary’s Karate Club, where Ai ss 6.7. Even though £ = 5.4 < Ax which 
is well below the threshold from the first-moment method, the contagion fades away eventually, which is correctly 
captured by our DMP approach. 


the linear regime). Qualitatively, this depends on bottlenecks in the network such as those due to community 
structure, where an epidemic spreads quickly in one community but then takes a longer time to cross over 
into another. Indeed, the second eigenvector of the non-backtracking matrix B was recently used to detect 
community structure [36] . 

Similarly, just as the leading eigenvector of B was recently shown to be a good measure of importance 
or “centrality” of a node [40| . it may be helpful in identifying “superspreaders”—nodes where an initial 
infection will generate the largest outbreak, and be the most likely to lead to a widespread epidemic. 


VI. CONCLUSION 

Modern epidemiological studies often require recurrent models, where nodes can return to their previous 
inhabited states multiple times. For example, consider diseases such as influenza where individuals are 
infected multiple times throughout their lives, or whooping cough where vaccine effectiveness wanes over 
time; in both cases, individuals return to the Susceptible class. In this paper we have extended Dynamic 
Message-Passing (DMP) to recurrent epidemic models. Our rDMP approach defines messages on the directed 
edges of a network in such a way as to prevent signals, such as the spread of infection, from backtracking 
immediately to the node that they came from. By preventing these “echo chamber effects,” rDMP obtains 
good estimates of the time-varying marginal probabilities on a wide variety of networks, estimating both the 
fraction of infectious individuals in the entire network, and the probabilities that individual nodes become 
infected. 

Like the pair approximation, rDMP takes correlations between neighboring nodes into account. However, 
our experiments show that rDMP is more accurate than the pair approximation for a wide variety of network 
structures and parameters. Moreover, rDMP is computationally less expensive than the pair approximation, 
especially for complex epidemic models with a large number of states, using O(mk) instead of 0(mk 2 ) 
variables for models with k states on networks with m edges. 

Finally, rDMP is conceptually simple, allowing the user to immediately write down the system of differ¬ 
ential equations for a wide variety of epidemic models, such as those with multiple stages of infection or 
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immunity mm, or those with multiple interacting diseases [?5] SB]. We expect that given its simplicity 
and accuracy, it will be an attractive option for future epidemiological studies. 
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